Single-Step Quantum Search Using Problem 

Structure 

Tad Hogg 
Xerox Palo Alto Research Center 
3333 Coyote Hill Road, Palo Alto, CA 94304 
hogg@parc.xerox.com 

February 1, 2008 



Abstract 

The structure of satisfiability problems is used to improve search algorithms 
for quantum computers and reduce their required coherence times by using only 
a single coherent evaluation of problem properties. The structure of random 
k-SAT allows determining the asymptotic average behavior of these algorithms, 
showing they improve on quantum algorithms, such as amplitude amplification, 
that ignore detailed problem structure but remain exponential for hard prob- 
lem instances. Compared to good classical methods, the algorithm performs 
better, on average, for weakly and highly constrained problems but worse for 
hard cases. The analytic techniques introduced here also apply to other quan- 
tum algorithms, supplementing the limited evaluation possible with classical 
simulations and showing how quantum computing can use ensemble properties 
of NP search problems. 



1 Introduction 



Quantum computers || [L7], 0, [L^, [2(], [39|, |43| offer the possibility of faster combina- 
torial search by operating simultaneously on all search states. For instance, quantum 



computers can factor integers in polynomial time ^7|, a problem thought to be in- 
tractable for classical machines. 

At first sight, quantum computers seem particularly well-suited for NP search 
problems [22| due to their efficiently-computable test of whether a given search state 
is a solution. Quantum computers can apply this test to exponentially many search 
states in about the same time as a conventional ("classical") computer tests just one 
and a variety of search algorithms have been proposed f| § 
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However, extracting a definite answer from this simultaneous evaluation appears to 
still give an exponentially growing search cost in the worst case 0. 

For general NP searches, amplitude amplification using a test of whether a 
search state is a solution, quadratically improves performance of heuristics consisting 
of many independent trials ||. This is the best possible improvement for "unstruc- 
tured" quantum methods, i.e., those using only such a test j|. Moreover, this tech- 
nique does not apply to more complex heuristics, e.g., those involving backtracking, 
tabu lists, parameters adjusted based on unsuccessful trials, cached nogoods or other 
forms of learning, abstraction or extensive preprocessing. Such heuristics often pro- 
vide the best known performance, at least on average, for a variety of combinatorial 
searches. A focus on typical behavior of large problems is important because often 
the worst cases are far harder than most instances encountered in practice. 

Thus, as a practical matter, there remain the questions of whether using problem 
structure in quantum algorithms can give more than quadratic improvement for the 
heuristics consisting of independent trials, any improvement at all for other types of 
heuristics, and less than exponential cost for at least some typical problems arising in 
practice. As one example, further improvement is possible for a quantum algorithm 



using detailed information on the distance of search states to solutions p8| , but in 
practice, such information is not readily available for most searches. 

Addressing these questions requires developing algorithms using problem struc- 
ture and determining their behavior for large problems. As with classical heuristics, 
such algorithms are often difficult to analyze theoretically due to complicated depen- 
dencies among successive search choices. Thus one is often forced to use empirical 
evaluation with a sample of problems. While quite common for evaluating classi- 
cal heuristics, this approach is limited to small problems for quantum algorithms on 
current machines due to the exponential increase in time and memory required for 
the classical simulation. Another approach, applied in this paper, evaluates average 
behavior over a simple ensemble of problems. Such ensemble-based analyses provide 



insight into typical behavior for large problems [51 



An extreme case is single-step quantum search, i.e., algorithms using only a single 
evaluation of structure associated with a problem. Single-step search is very effec- 
tive for highly constrained problems [|3T|, outperforming both unstructured quantum 
search and classical heuristics in these cases. Can the technique used for highly con- 
strained problems be extended to the more challenging case of hard search problems 



with an intermediate number of constraints [12, |33|? Conversely, to what extent does 
the restriction to a single step limit the extent to which the capabilities of quantum 
computers can be used? 

Single-step search is particularly well-suited for an ensemble-based analysis, since 
it avoids the dependencies found in multistep quantum algorithms or classical heuris- 
tics. Furthermore, single-step methods require far less coherence time than the un- 
structured algorithm with its exponentially many steps, and hence should be easier 
to implement. This is because maintaining coherence over many computational steps 
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is difficult p8|, |50|, |29|, f40f . While this difficulty is unlikely to be a fundamental limi- 



tation H |46|, |37j and small quantum computations have been implemented [[14], [T3|] 



algorithms that minimize the required coherence time simplify hardware implemen- 
tation. 

This paper gives an ensemble analysis for a single-step algorithm using the number 
of conflicts in each search state. We evaluate the asymptotic average scaling behavior 
directly, rather than relying on simulations. This result allows optimizing the algo- 
rithm, and also demonstrates a general technique for studying the average behavior 
of quantum search algorithms. We compare the asymptotic predictions to evaluations 
of small cases accessible to simulation, showing good correspondence even for small 
problems. 

In the remainder of this paper, we first summarize the NP-complete satisfiability 
search problem and then describe a class of one-step quantum algorithms for it. This 
class includes both the previous unstructured and highly constrained methods as 
special cases. We identify the best performing algorithms for satisfiability problems 
with differing degrees of constraint in the following two sections. We then present 
some additional behaviors of the algorithm and briefly consider extensions to more 
complex algorithms suggested by these results. Details of the derivation are in the 
appendices. 

As a note on notation, to compare the growth rates of various functions we use p5 | 
/ = O (g) to indicate that / grows no faster than g as a function of n when n — > oo. 
Conversely, f = Q(g) means / grows at least as fast as g, and f = Q (g) means both 
functions grow at the same rate. 



2 Satisfiability 

Satisfiability (SAT) is a combinatorial search problem |22| consisting of a logical 
propositional formula in n variables Vj, . . . , V n and the requirement to find a value 
(true or false) for each variable that makes the formula true. This problem has 
N = 2 n assignments. For fc-SAT, the formula consists of a conjunction of clauses 
and each clause is a disjunction of k variables, any of which may be negated. For 
k > 3 these problems are NP-complete. A clause with k variables is false for exactly 
one assignment to those variables, and true for the other 2 k — 1 choices. An example 
of such a clause for k = 3, with the third variable negated, is V\ OR Vi OR (NOT 
V3), which is false for {V\ = false, V 2 = false, V 3 = true}. Since the formula is a 
conjunction of clauses, a solution must satisfy every clause. We say an assignment 
conflicts with a clause when the values the assignment gives to the variables in the 
clause make the clause false. For example, in a four variable problem, the assignment 

{Vi = FALSE, V 2 = FALSE, V 3 = TRUE, K 4 = TRUE} 
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conflicts with the k = 3 clause given above, while 



{ Vi = FALSE, V 2 = FALSE, V 3 = FALSE, V 4 = TRUE} 

does not. Thus each clause is a constraint that adds one conflict to all assignments 
that conflict with it. The number of distinct clauses m is then the number of con- 
straints in the problem. 

The assignments for SAT can also be viewed as bit-strings with the correspon- 
dence that the i th bit is or 1 according to whether V is assigned the value false or 
true, respectively. In turn, these bit-strings are the binary representation of integers, 
ranging from to 2 n — 1. For definiteness, we arbitrarily order the bits so the values 
of Vi and V n correspond, respectively, to the least and most significant bits of the 
integer. For example, the assignment 

{Vi = FALSE, V 2 = FALSE, V 3 = TRUE, V 4 = FALSE} 

corresponds to the integer whose binary representation is 0100, i.e., the number 4. 

For bit-strings r and s, let \s\ be the number of 1-bits in s and r A s the bitwise 
AND operation on r and s. Thus \r A s\ counts the number of 1-bits both assignments 
have in common. We also use d(r, s) as the Hamming distance between r and s, i.e., 
the number of positions at which they have different values. These quantities are 
related by 

d(r, s) = \r\ + \s\ — 2\r A s\ (1) 

Let c(s) be the number of conflicts for assignment s in a given SAT problem. 

An example 1-SAT problem with n = 2 is the propositional formula (NOT Vi) 
AND (NOT V 2 ). This problem has a unique solution: {Vi = false, V 2 = false}, 
an assignment with the bit representation 00. The remaining assignments for this 
problem have bit representations 01, 10, and 11. 

Theoretically, search algorithms are often evaluated for the worst possible case. 
However, in practice, search problems are often found to be considerably easier than 



suggested by these worst case analyses 153]. This observation leads to examining 



the typical behavior of search algorithms with respect to a specified ensemble of 
problems, i.e., a class of problems and a probability for each to occur. A useful 
ensemble is random fc-SAT, specified by the number of variables n, the size of the 
clauses k and the number of distinct clauses m. A problem instance is created by 



randomly selecting m distinct clauses from the set of all possible clauses |41| . When 
n is large, the typical behavior of random /c-SAT is determined by fi = m/n, the ratio 
of clauses to variables. In particular, for each k there is a threshold value /z cr it on fi 
below which most random /c-SAT problems are soluble and above which most have 
no solutions [T3|, 13 1. For k = 3, this value is approximately ix cr - lt = 4.2. 



The quantum searches considered here are incomplete methods, i.e., they can find 
a solution if one exists but can never guarantee no solution exists. For studying such 
algorithms, the ensembles would ideally contain only instances with a solution. For 
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example, we could consider the ensemble of random soluble fc-SAT, in which each 
instance with at least one solution is equally likely to appear. Unfortunately, this 
ensemble does not have a simple expression for the number of problems as required 
for the analytic performance evaluation given below. Instead, for fi < fi CT a, most 
random problems indeed have a solution so random fc-SAT is useful for studying 
incomplete search methods for underconstrained problems. 

Randomly selected overconstrained problems usually have no solutions so random 
SAT is not a useful ensemble when \i > /z crit . An alternative with simple analytic 
properties is the ensemble with a prespecified solution. In this particular as- 

signment is selected to be a solution. Then the m clauses are selected from among 
those that do not conflict with the prespecified solution. Compared to random se- 
lection among soluble problems, using a prespecified solution is more likely to pick 
problems with many solutions, resulting in somewhat easier search problems, on av- 
erage. 

Each clause in a fc-SAT formula conflicts with exactly one of the 2 k possible 
assignments for the variables that appear in the clause. Thus the average number 
of conflicts in an assignment is c avg = m/2 k . While this average is the same for all 
SAT problems with given m and k, the variance in the number of conflicts varies from 
problem to problem. As described in Appendix [A], the the variance for random /c-SAT 
is c avg (l — 2~ k ). Thus when m 3> 1, the relative deviation decreases as 0(1/ ^/m) 
and hence the number of conflicts in most assignments is very close to the average. 

For random fc-SAT, the expected number of solutions is [|5l| 



(S) = 2 n (l - 2- k ) m = 2 n exp(/inlog(l - 2~ k )) (2) 



3 Quantum Search Algorithms 

Quantum computers use physical devices whose full quantum state can be controlled. 
For example [pjj, an atom in its ground state could represent a bit set to 0, and 



an excited state for 1. The atom can be switched between these states and also be 
placed in a uniquely quantum mechanical superposition of these values, which can be 

denoted as a vector ^ , ^ , with a component (called an amplitude) for each of the 

corresponding classical states for the system. These amplitudes are complex numbers. 

A quantum machine with n quantum bits exists in a superposition of the 2 n 
classical states for n bits. The amplitudes have a physical interpretation: when 
the computer's state is measured, the superposition randomly changes to one of the 
classical states with \ip s \ 2 being the probability to obtain the state s. Thus amplitudes 
satisfy the normalization condition J2 S l^sl 2 = 1- This measurement operation is used 
to obtain definite results from a quantum computation. 

Quantum algorithms manipulate the amplitudes in a superposition. Because 
quantum mechanics is linear and the normalization condition must always be sat- 
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isfied, these operations are limited to unitary linear operators. That is, a state vector 
if) can only change to a new vector if)' related to the original one by a unitary trans- 
formation, i.e., if>' — Uif> where U is a unitary matrix^ of dimension 2 n x 2 n . In spite 
of the exponential size of the matrix, in many cases the operation can be performed 
in a time that grows only as a polynomial in n by quantum computers |8|, |35], [M]. 
Importantly, the quantum computer does not explicitly form, or store, the matrix U. 
Rather it performs a series of elementary operations whose net effect is to produce the 
new state vector if)'. The components of the new vector are not directly accessible: 
rather they determine the probabilities of obtaining various results when the state is 
measured. 

Search algorithms for SAT problems use efficiently computed properties of indi- 
vidual assignments, e.g., a test of whether a given assignment is a solution. With 
quantum computers, these properties can be evaluated simultaneously for all assign- 
ments. In this paper we focus on algorithms that make use of this simultaneous 
evaluation just once. 



3.1 Single-Step Search 

Single-step methods could be implemented in a variety of ways. One simple approach 
starts with an equal superposition of all the assignments, adjusts the phases based on 
the number of conflicts in each of the assignments, and then mixes the amplitudes from 
different assignments. This algorithm requires only a single testing of the assignments, 
corresponding to a single classical search step. 

For a A;-SAT problem with n variables and m clauses, the algorithm takes the fol- 
lowing form. The initial state has amplitude if) s = 2~ n / 2 for each of the 2 n assignments 
s, and the final state vector is <f> = UPif> where the matrices P and U are defined 
as follows. The matrix P is diagonal with P ss = p c r s ^ depending on the number of 
conflicts c in the assignment s, ranging from to m. Because the number of conflicts 
in a given assignment is efficiently computable for SAT problems, these phase choices 
can be efficiently implemented . 

The mixing matrix is defined in terms of two simpler operations: U = WTW. 
The Walsh transform W has entries 

W rs = 2-™/ 2 (-l)l rAs l (3) 



for assignments r and s and can be implemented efficiently || |2"7|| . The matrix T 
is diagonal with elements T rr = t\ r \ depending only on the number of 1-bits in each 
assignment, ranging from to n. These definitions for W and T lead to a mixing 
matrix U whose elements U rs = UM r , s ) depend only on the Hamming distance between 

1 A complex matrix U is unitary when UW = I, where is the transpose of U with all elements 
changed to their complex conjugates. Examples include permutations, rotations and multiplication 
by phases (complex numbers whose magnitude is one). 
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the assignments r and s, with p2"f 



d n—d+z 



z=0 h=z 




Unlike previous algorithms, where phase choices are often just ±1, this algorithm 
potentially uses a different phase choice for each number of conflicts and each number 
of 1-bits in an assignment. 

This procedure defines a class of algorithms. A particular choice of the phases 
p c and th completes the algorithm's specification. For example, the choices po = 1, 
t = 1 and the remaining phases set to —1 gives a single step of the unstructured 
search algorithm [E7J. Another example is p c = i c and th = i h , appropriate for 



maximally constrained 1-SAT problems [^T|] 



3.2 Selecting Phase Values 

To determine appropriate choices for p c and th, we can evaluate the algorithm, via 
classical simulation, for samples of random SAT problems with small n using a variety 
of choices for p c and th- Numerical optimization of average performance with respect 
to these choices then identifies values giving high performance for random /c-SAT. 
These optimal values show logp c and logt^ vary nearly linearly with c and h over 
most of their range. This observation suggests that restricting consideration to such 
linear variation is likely to give a reasonable idea of the best such algorithms can 
perform, while simplifying the analysis. 

To further understand why such choices are appropriate for large fc-SAT problems, 
note that the number of assignments with h 1-bits is ftj. So for large n, most have h 
close to n/2. Similarly, for the phases p c , provided the number of clauses is large, i.e., 
m ^> 1, most assignments have nearly the average number of conflicts c avg = m/2 k . 
For large problems, we can expect the behavior of the phases near the average values 
will be the only important choices influencing the algorithm's behavior. Thus we 
consider an expansion around the dominant values of the form 



> ) +^(/.-|)+r( 2 )(/,-^) 2 + ...)) (5) 



th = exp m 



where the are constants, and similarly for p c . The first term in such an expansion 
just gives a constant overall phase factor for the amplitudes, which has no effect on 
the probability to find a solution, and so can arbitrarily be set equal to zero. The 
next term in the expansion, giving linear variation in phases, affects the solution 
probability. For assignments close to the average, this linear variation dominates the 
behavior. 

From both the empirical observations on optimal phase values for small problems 
and the increasing concentration of values for h and c and n increases, we are led 
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to consider a linear variation in the phase values. If, in spite of these motivating 
arguments, including some nonlinearity in the phase values improves the leading 
asymptotic behavior, a restriction to linear variation would still provide a lower bound 
on the possible performance of single-step algorithms. Thus we restrict consideration 
to phase choices described by two constants p and r with 

and 

The terms c avg and n/2 in these expressions just give an irrelevant overall phase to the 
amplitudes, but slightly simplify the analysis. As shown in Appendix [B| this choice 
for th gives 



u d = cos" ^— j tan" ^— j {-i 

For example, when r = 1/2, Ud = 2 - ™/ 2 (— i) d as used for solving 1-SAT problems [ |3Tf . 

Because c and h are integers, it is sufficient to consider values for p and r in 
the range —1 to 1. Since changing the sign of both p and r simply conjugates the 
amplitudes, we can further restrict consideration to r in the range to 1. 

Completing the algorithm requires particular choices for p and r. The asymptotic 
analysis given below identifies optimal choices for these parameters based on the 
values of k and m/n. 

3.3 Asymptotic Behavior for Random A;- SAT 

When the phase choices are particularly simple, as with the unstructured search al- 
gorithm P7| |, or the problem has a simple relation between Hamming distance from 



a solution and number of conflicts, as in 1-SAT problems pl| , the probability to ob- 
tain a solution, P so i n , has a simple analytic form. In more general cases, the solution 
probability can only be evaluated with a classical simulation, limiting the study of 
more complex algorithms or problem structures to relatively small sizes. A third ap- 
proach to evaluating P so i n is to average over an ensemble of problems. This approach, 
developed in Appendix [C| uses the structure of search problem ensembles to analyze 
the asymptotic behavior of the algorithm, and hence to select the best values for p 
and r. Specifically, for random fc-SAT, the average of P so i n scales as 

(P soln ) oc e -^(*./w) (9) 

where the decay rate A can be evaluated numerically for given choices of k, /i, p and 
r. 

Given the ability to numerically compute A, we can then optimize the performance 
of the algorithm, measured in terms of the average probability to find a solution, i.e., 
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minimizing A. This numerical optimization gives values for p and r appropriate to 
random fc-SAT with a given value of fi. 

In practice, implementation limitations will introduce some errors in the parame- 
ters. Fortunately, the precision required is not particularly strict because the param- 
eters appear in the exponents. In particular, an error of e in p or r will give error 
O (e 2 ) in the exponent. Thus only a square root precision in the implementation of 
the values of p and r is required. While by no means trivial, this shows the algorithm 
does not require exponentially precise parameter values to achieve the scaling. 

As an example, for the weakly constrained case in §|5|, such errors in parameter 
values gives exponential decay of exp(0 (e 2 ) m). To maintain © (1) behavior, e must 
be small enough that e 2 m = (1), i.e., the precision requirement is e = O (1/y/m). 
The precise scaling due to errors depends on the size of the second derivatives of 
A around the optimal p and r values. For k = 3, a root-mean-square combined 
error of e in p and r introduces at worst a factor exp(— 3.18e 2 m). This is greater 
than 1/2 provided e < 0.46/^/m. For example, weakly constrained problems with 
m = 2y/n satisfy this requirement for a 10,000 variable problem provided e < 0.033, 
which allows, roughly, a 10% error in the parameter values. Similarly, for m ^> n, such 
parameter errors give exponential decay of exp(0 (e 2 ) n), so the precision requirement 
is e = 0(l/y/n). 

3.4 Unstructured Search 

As a point of comparison with the one-step algorithm based on the number of conflicts 
in assignments, the unstructured search algorithm |27| applies amplitude amplification 
to random selection, giving a quadratic speedup after an exponentially large number 
of steps. Specifically, for a problem with n variables and S solutions, the probability 
in solutions after j steps is || sin 2 ((2j + 1)0) with sin(#) = J S/2 n . 

From Eq. (fj), for random fc-SAT with fixed p the fraction of assignments that are 
solutions, S/2 n , is exponentially small, and 9 ~ e /mi°g(i-2 )/2_ Thus for any fixed 
number of steps, j, the scaling of (P so i n ) is (e^ 10 ^ 1 ' 2 the same as random 
selection. This scaling is independent of the number of steps (provided j is constant). 
When j increases exponentially with n, specifically j = then the probability 

of a solution with this unstructured algorithm is (1). 

More generally, given a classical or quantum method consisting of independent 
trials, each of which produces a solution with probability p, amplitude amplification 
produces the quadratic improvement with sin(6 ) ) = ^fp. Thus the cost is (l/y/pj 
times the cost for a single trial. This general result means quantum computers can 
improve on many classical methods. However this quadratic improvement is also the 
best possible for quantum methods based only on testing whether assignments are 
solutions [|J. Moreover, some classical heuristics use information from unsuccessful 
trials to improve future ones, i.e., trials are not independent. Others spend most of 
their effort in preprocessing followed by rapid identification of the solution. In these 
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Figure 1: Smallest exponential decay rate A for (P so i n ) as a function of p = m/n for random 
3-SAT. For comparison, the gray curve shows the scaling for random selection. The points 
indicate empirical estimates of the decay rate for the fraction of soluble problems, a lower 
bound on the decay rate for P so i n . For large p, these estimates are difficult to obtain. A 
weaker lower bound, shown as the upper edge of the filled region, is given by the Markov 
bound using expected number of solutions for random 3-SAT. 



cases, even this quadratic improvement does not apply. 



4 Solving Hard Problems 

For random fc-SAT, the hardest problem instances are concentrated near a threshold 
value of p = m/n depending on k. For 3-SAT, this threshold is at /x = 4.2 ||15|| . 
Thus we examine the behavior of the single step algorithm when p is constant. In 
this case, the minimum decay rate A is shown in Fig. |I|. The corresponding best 
choices for p and r are shown in Fig. [| Note that the r values are less than 1/2 
which, from Eq. (|8|), means the Ud matrix elements are largest for small d, hence 
emphasizing the mixing at distances less than n/2 allowing the algorithm to exploit 
the clustering of assignments with relatively few conflicts in fc-SAT. These values, 
obtained by numerical minimization of A with respect to p and r, could be local 
minima. If so, other choices for p and r would give even better performance than the 
values reported here. For comparison, Fig. |l] shows the scaling of random selection, 
(S) /2 n , where S is the number of solutions, using Eq. @. 

An important observation from these results is even the best use of problem struc- 
ture based only on the number of conflicts cannot remove the exponential search cost 
with a single step algorithm of the type described here. 



The unstructured search algorithm |27j consists of applying amplitude amplifica- 



tion to random selection. Thus a second observation from Fig. [y is, for p less than 
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Table 1: Best parameter values and scaling behavior for single-step search of 3-SAT prob- 
lems for random and prespecified solution ensembles. 
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Figure 2: Optimal choices of r (black) and p (gray) as a function of fj, = m/n. Solid curves 
are for the random 3-SAT ensemble and the dashed curves are for the ensemble with a 
prespecified solution. 



about 3.5 (where A < —^-\og{{S) /2 n )), a single step with the optimal choices of 
p and r gives exponentially better performance, on average, than the unstructured 
search algorithm, which also requires coherence extending over multiple steps. Thus 
this analysis demonstrates how the structure of search ensembles can be exploited 
to improve quantum search performance and simultaneously reduce the required co- 
herence time. Moreover, by giving the actual asymptotic scaling this result is more 
definitive than prior empirical studies of algorithms based on classical simulations of 
small problems [ 30[ . 

Furthermore, the one-step algorithm can be combined with amplitude amplifica- 
tion || to achieve an additional quadratic improvement, corresponding to dividing 
the decay rate by a factor of 2 for soluble problems. This combination requires ex- 
tending coherence time beyond just one step, but because the reduced decay rate is 
then below that of the unstructured algorithm over the whole range of fi, not only is 
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performance better but the coherence time is still less than that of the unstructured 
algorithm. Thus this new algorithm improves on the unstructured one, on average, 
over the whole range of p. 

As another comparison, the cost of a good classical heuristic method is empirically 
observed |15[ to scale as 2™/ 19 ' 5 for random 3-SAT problems near p = 4.2, correspond- 
ing to A equal to log(2)/19.5 = 0.036. This scaling is better than the single-step 
quantum algorithm, which has A = 0.30 for p = 4.2. Even combined with amplitude 
amplification, reducing the decay rate to 0.15, the classical heuristic remains better. 

Beyond p = 4.2, the fraction of soluble problems, P so iubie, drops to zero as n 
increases for random 3-SAT. The performance of this algorithm for soluble problems 
is given instead by (P so in) /Psoiubie- Thus if P so iubie scales as e _am , the algorithm's 
decay rate for random soluble problems is A — u, and A is always at least as large 
as oj. Unfortunately, the random fc-SAT ensemble does not have a simple expression 
for P S oiubie, or even just its leading exponential scaling rate u>, precluding an exact 
evaluation of the behavior with respect to overconstrained soluble problems. 

One approach to estimate this behavior uses empirical classical search to evaluate 
insoluble for a range of problem sizes for a given value of p. The behavior of these 
values as a function of n then estimates uj. For instance, a study [44] using samples 
of 10 4 problems for n from 50 to 250 shows close to exponential decrease of P so iubie for 
p values somewhat above the transition. The resulting estimates of the actual decay 
rates for P so iubie are 0.011, 0.025 and 0.045 for p equal to 4.5, 4.7 and 4.9, respectively. 
These values are considerably smaller than the value of A for the one-step algorithm 
for these values of p, as shown in Fig. [I]. Nevertheless, the increase in uj accounts for 
most of the increase in A over this range of p values, i.e., the soluble problems are 
not continuing to get much harder for this one-step algorithm above the transition. 

This empirical technique is increasingly difficult to apply as fi increases due to 
the rapidly decreasing fraction of soluble problems in the ensemble [ 44| . For larger fi 
we can instead obtain a lower bound on the decay rate for P so iubie using the Markov 
inequality: P so iubie < (S). That is, this analysis averages over all problems in the 
ensemble, so (P so i n ) < Psoiubie- Thus, (P so i n ) < Psoiubie < (S), corresponding to 
A > uj > -J log (5). When fi > - log(2)/log(l - 2~ fc ) (equal to 5.19 for k = 3), 
Eq. (0) gives (S) -> as n -> oo so this becomes a nontrivial bound as shown 
in Fig. |l|. However, as a lower bound, this inequality cannot be used to determine 
whether overconstrained soluble problems indeed become easier to solve with the 
one-step algorithm as fi increases. 

An alternate approach uses an ensemble where all problems are soluble and that 
is analytically simple, e.g., the ensemble with a prespecified solution described in §[| 
The evaluation of (P so i n ) proceeds as described in Appendix 0, with the addition of 
needing to keep track of the distances of the assignments r, s and s' to the prespecified 
solution (which affects the available number of clauses that can be selected to produce 
the required numbers of conflicts). The resulting optimal behavior is shown in Fig. [3], 
with the corresponding best values for p and r given in Fig. |2|. The decay rate 
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Figure 3: Smallest exponential decay rate A for (P so in) as a function of // = m/n for 3-SAT 
with prespecified solution. For comparison, the gray curve shows the scaling for random 
selection. 



decreases as \i increases past 7, i.e., problems become easier as the number of clauses 
increases. Presumably a similar behavior would be seen for the soluble cases in 
the random ensemble as well because these two ensembles are fairly similar when 
\x is large. This observation of problems becoming easier as /i increases past the 
transtion corresponds to behavior seen with many classical methods. On the other 
hand, random selection and the unstructured algorithm do not improve as [i increases: 
they do not take advantage of the structure of highly constrained problems. 

Since classical simulations of quantum algorithms are limited to few variables, this 
asymptotic analysis can also indicate the extent to which these simulations match the 
asymptotic behavior. An example is Fig. |] showing that the behavior matches that 
from Table [I] for fi = 2 and /i = 4 even with a small number of variables. This 
suggests the limited sizes accessible with classical simulation may nevertheless be 
sufficient to indicate asymptotic behavior, as is also seen in some studies of classical 
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5 Solving Weakly and Highly Constrained Prob- 
lems 

The behavior of the minimum decay rate shown in Fig. [I] and |3] suggests that A 
decreases toward zero for small and large values of [i for soluble problems. This is 
confirmed in Fig. [5| which shows the behavior of both ensembles over a larger range 
of values. 

As fi — > 0, the figure shows the minimum decay rate is nearly a straight line with 
slope 2 on this log-log plot, indicating A = B (/x 2 ) in this limit. With this limiting 
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Figure 4: Optimal asymptotic behavior of (P so in) for 3-SAT with u = 2 (gray) and 4 (black) 
on a log-scale vs. n. The points show the exact values of (P so in)- 




0.01 0.1 1 10 100 1000 10000 

m/ n 



Figure 5: Limiting behaviors, on a log-log plot, for exponential decay rate A. Behavior 
for random 3-SAT (black curve, up to u = 6), prespecified solution 3-SAT (gray curve, for 
u between 0.1 and 40) and the upper bound based on probability to find the prespecified 
solution (black curve, for u > 20). For comparison the dashed lines show the limiting 
behaviors for small and large fj,, which correspond very closely to the exact values for 
H < 0.3 and a > 1000. 

behavior (P so i n ) oc e~ An with An = Q{m 2 /n). In particular, if m grows no faster 
than y/n, (P so \ n ) will remain (1) as n increases. 

Similarly, as \l — >• oo, Fig. ^| shows A decreasing in a straight line with slope — 1, 
indicating A = B(l//i). Correspondingly, An = Q(n 2 /m). So if m grows at least 
as fast as n 2 , the probability to find a solution, on average, will remain (1) as n 
increases. 
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Appendix confirms these observations. While such weakly and highly con- 
strained problems are fairly easy, the single-step algorithm outperforms classical 
heuristic methods for these cases, which require evaluating G (n) assignments on 
average. 

n m (P soln ) (S)/2 n 



4 4 0.908 0.569 

9 6 0.897 0.447 

16 8 0.894 0.343 

25 10 0.893 0.263 

36 12 0.892 0.201 



Table 2: Scaling of (P so in) horn Eq. ( p3|) and (S) /2 n for weakly constrained random 3-SAT 
with m = 

As an example, when k = 3, A = a wea k/-t 2 with a wea k = 0.029405, shown as 
the dashed line in Fig. || for the — > limit. For fi = 2/^/n, Table ||] shows the 
approach to the asymptotic limit e~ An = exp(— 4a weak ) = 0.889. This behavior 
compares with the still rapid decrease in expected fraction of solutions which scales 
as (S) /2 n = (1 — 2~ k ) m or (7/8) m for k = 3. Thus the unstructured search scaling 
for this example is (7/8)^™ which still decreases faster than polynomially. 

At the other extreme, Appendix |(J.4| shows that as [i — > oo, A ~ (2 k — l) 3 7r 2 / (lQk 2 fi), 
shown in Fig. |5|. Hence, when m = Q(n 2 ), we have (1) performance. This anal- 
ysis also improves on previous work based on a lower bound estimate [32] showing 
G (1) behavior for highly constrained problems, but only when m grew faster than a 
particular multiple of n 2 (equal to 17.3 for k — 3). 



6 Problem Search Costs 

The ensemble average leading to Eq. (^|) provides a direct analysis of (P so in)- This 
technique generalizes to quantities involving positive integer powers of P S oin, such as 
the variance discussed in § [7.1| . Unfortunately the technique does not apply to quanti- 
ties such as the expected solution cost which, for any particular problem, is l/P S oin for 
independent trials, or (l/v^oin) when combined with amplitude amplification 0. 
Thus an important question is the extent to which an analysis based on (P S oin) pro- 
vides insight into actual search costs, and hence is useful in selecting appropriate 
phase parameters. 

We can approach this question through an empirical evaluation of a sample of 
problems. However, for characterizing the typical behavior of problems, it is impor- 
tant to keep in mind that ensembles with even one problem with no solutions have 
(l/P S oin) = oo. Even restricting consideration just to soluble problems, this ensemble 
average can be dominated by the exceptionally high costs of just a few instances, 
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does not usefully characterize typical search behaviors. A more useful quantity is the 
median of l/P so in, whose properties are even more difficult to determine theoretically 
than the mean. Instead Table |3| compares these quantities based on classical sim- 
ulation. We see that /p 1 , underestimates the median search cost, but is a better 
estimate than (l/P so i n ) even when restricted to soluble problems. 
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Table 3: Comparison of search cost estimates based on 1000 soluble random 3-SAT prob- 
lems using optimal parameter values from Table |]. 

More generally we can examine the full distribution of problem search costs. For 
instance, Fig. |] compares the unstructured method with the combination of amplitude 
amplification with the one-step algorithm. This shows a reduction in cost from using 
problem structure, corresponding with the above discussion of the relative costs, 
in conjunction with Fig. [I], based on the analysis of (P so in)- The behavior of the 
unstructured search depends only on the number of solutions, leading to the vertical 
groups of points in the figure. By contrast, the structured method shows considerable 
variation in costs even among problems with the same number of solutions. 

The full distribution of costs available from empirical evaluation of a sample of 
random A;-SAT problems can address questions beyond those possible by an analysis 
of average behavior. For example, to what extent do classical and quantum methods 
find the same problems particularly difficult? Fig. [7| compares the expected costs 
using the single-step quantum search with a classical heuristic when combined with 
amplitude amplification. Specifically, the expected quantum search cost for a single 
instance is given by l/P so i n - When used with amplitude amplification, the expected 
cost is j\Jl/ Psoin provided P so i n is known, and otherwise is up to 4 times larger [|J. 
For a classical comparison, each problem was solved repeatedly with the GSAT local 
search method using a limit of 2n steps for each trial: if a solution was not found 
after that many steps, a new trial was started. Classically, the expected search cost 
is the ratio of the total number of GSAT steps to the number of solutions found by 
these repeated searches. But when used with amplitude amplification, trials cannot 
end early just because a solution is found, instead they must run to completion (i.e., 
the full 2n steps in this case). While this makes little difference for large problems, 
where most of the cost is due to the many unsuccessful trials typically required before 
a successful trial, it does limit GSAT's benefit from amplitude amplification for the 
smaller problems treated here. The cost of GSAT with amplitude amplification is 
j2nJl/P so \ n where here P so i n is the probability a GSAT trial finds a solution and the 
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Figure 6: Comparison of search costs for the one-step quantum method combined with 
amplitude amplification and unstructured amplitude amplification. Each point shows the 
expected search cost for a single problem instance of random 3-SAT with n = 20 and 
m = 80, assuming the number of solutions and -P so in are known a priori. In practice these 
values will not be known a priori, increasing the costs by up to a factor of 4 |8| . 



factor 2n counts the number of steps for each trial. The one-step method exploits 
amplitude amplification more effectively than GSAT, giving somewhat smaller costs 
shown in Fig. |7|. 

Without combining with amplitude amplification, the absolute number of steps 
required for the one-step quantum method is larger than the classical heuristic for 
these problems. This contrasts with sufficiently weakly or highly constrained problems 
where the quantum method requires G (1) steps while the classical search uses (n). 

The figure also shows a general correlation between search difficulty for the two 
methods, largely reflecting the variation in number of solutions, i.e., both methods 
tend to have higher costs for problems with fewer solutions. Examining just problems 
with the same number of solutions shows little correlation between the two methods. 
This indicates different aspects of problems (beyond their number of solutions) ac- 
count for particularly hard cases for the quantum and classical methods. Identifying 
these different aspects, and hence classes of problems for which quantum methods 
may be particularly well suited, is an interesting direction for future work. Further- 
more, this observation suggests a combination of techniques may be a particularly 
robust approach to combinatorial search, as has been studied for combinations of 
classical methods |T6|, |36|, |24~ . 



As a final note, the cost measure used here is in terms of number of steps, with a 
step corresponding to the evaluation of the conflicts in an assignment. This measure 
is commonly used for general comparisons among search algorithms, especially their 
scaling behavior. However one should also keep in mind the relation among these 
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Figure 7: Comparison of search costs for the one-step quantum method and GSAT, both 
combined with amplitude amplification, for random 3-SAT with n = 20 and m = 80 using 
the same problems as in Fig. ^| Each point shows the expected search cost for a single 
problem instance assuming the probability to find a solution on a single trial of each method 
is known. In practice these values will not be known a priori, increasing the costs by up to 
a factor of 4 ||. Those points below the line use more steps for GSAT than the structured 
quantum method. 

algorithmic steps, more elementary computational operations implemented in hard- 
ware and actual computational time [|32] . This relation depends on the details of the 
hardware, overhead of any necessary error correction, the choice of data structures 
and compiler optimizations. For quantum computers, these details are not yet clear 
but the number of elementary operations to count the number of conflicts in an as- 
signment will be roughly the same for quantum and classical machines. The ratio of 
actual times required for each step on quantum and classical machines will instead 
be mainly determined by the technologically feasible clock rates. 

In summary, the analysis based on (P so i n ) gives a reasonable guide to the typical 
search costs, confirming the improvement of the new algorithm over unstructured 
search (both in performance and a reduction in the required coherence time). Al- 
though comparable with classical heuristics for hard problems, it remains to be seen 
how the behavior seen here for n = 20 scales to larger problems. In particular, 
n = 20 is small enough to be relatively easy for GSAT, with solutions typically found 
in just a few trials thus limiting the extent to which it can benefit from amplitude 
amplification. 
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7 Extensions 



This section describes extensions of the analysis: to compute the variance in P so i n 
among problem instances and the asymptotic behavior of algorithms with more than 
one step. We then discuss how the analysis can be applied to algorithms incorporating 
additional problem structure, specifically the conflicts in partial assignments. 



7. 1 Variance 

The analysis described above gave the asymptotic behavior of the expected value of 
P so i n for random fc-SAT. The technique can also be applied to determine (P S o ln ) and 
hence the variance of these values among different problems. The result has the same 
form as the average, i.e., (P S oi n ) oc e~ nB ^ k ' fl ' p ' T ^ though the analysis is somewhat more 
complicated. Numerical evaluation for a variety of cases gives B slightly smaller than 
2A, where A is the decay rate for (P so in)- Thus the scaling of the variance, (P S oi n ) — 
(Psoin) 2 is dominated by e~ nB and the standard deviation scales as e~ nB ^ 2 , hence 
decreasing slightly slower than the average. This observation leads to a relatively large 
spread in the distribution of P so in values among different problems, corresponding to 
the large variations seen in §§| 

In addition to indicating how close to the average instances are likely to be, evalu- 
ating the variance could be used as an alternate basis for selecting the phase parame- 
ters, namely to minimize the variance even at the expense of somewhat worse average 
performance. Algorithms with different tradeoffs between variance and average could 
then be usefully combined in a portfolio approach BO, 



7.2 Multiple Steps 

The techniques used in Appendix extend to algorithms using more than one step, 
provided the number of steps remains fixed as n increases. However the detailed 
analysis becomes more complicated since j steps requires the relationships among 
2j + 1 assignments, generalizing Fig. ^ in Appendix |C|to 2 2j variable groups. Thus 
computational time required to evaluate the exact asymptotic behavior grows very 
rapidly with j, limiting the practical utility of this technique to relatively small values 
of j. For larger j, and in particular when j increases with n, other techniques will 
be necessary. Nevertheless, the exact behavior as n —>■ oo for a few small values of j 
may suggest useful directions for designing improved algorithms. 

Multiple steps also introduce additional parameters: different values of p and r 
can be used for each step. The simplest approach, taking the same values for all 
steps, gives only modest reductions in the decay rates compared to a single step. 
On the other hand, allowing independent values gives larger reductions but requires 
numerical optimization of the decay rate with respect to 2j parameters p^ h > and 
for steps h — 1, . . . , j. 
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Figure 8: Minimum decay rates for multiple steps as a function of p for (from top to 
bottom) 1 through 5 steps using linear variation in phase parameters with step number. 
The curve for the 1-step case is the same as shown in Fig. |l]. The points indicate empirical 
estimates of the decay rate for P S oiubie) a lower bound on the decay rate for P so i n - The upper 
edge of the filled region is, in turn, a lower limit on -P so iubie given by the Markov bound. 



Evaluating optimal parameters for up to 4 steps gives values whose variation is 
nearly linear with the step number. In fact, restricting consideration only to param- 
eters with linear variation, i.e., of the form p^ = pa + hp-Q and = ta + Iitb gives 
a decay rate very close to that achieved when parameters are optimized individually 
for each step. This linear form only requires optimizing over the four values pa, Pb, 
ta and tb no matter how many steps are involved. 

As an example, for p = 4 and j = 4 steps the optimal decay rate is numerically 
found to be 0.128. Restricting the parameters to vary linearly, gives only a slightly 
larger value: 0.129. However, requiring the same values for each step gives a consid- 
erably larger decay rate: 0.211. These values compare with A = 0.280 for the 1-step 
method given in Table [1]. 

By comparison, as described in § ^4| the decay rate of the unstructured search 
is unchanged by any fixed number of steps: it decreases only when j grows with n, 
reaching when the number of steps grows exponentially with n since in that case it 
achieves P so i n ~ 1. 

Fig. H shows the behavior of the optimal decay rate, restricted to linear variation 
in the phase parameters, for various p values for j from 1 to 5. As with the 1-step 
method, a further quadratic improvement is possible by combining these methods 
with amplitude amplification, corresponding to dividing these decay rates by 2 for 
soluble cases. 

As with the discussion of Fig. [l|, for p above the transition point, removing the 
portion of the decay due to the insoluble problems shows most of the increase past 
the transition is due to the insoluble problems. In fact, the decay rate corresponding 
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Figure 9: Scaling of minimum decay rates vs. number of steps, for \x equals 2 (black), 4 
(gray) and 6 (dashed). 

to random soluble problems reaches a maximum and then decreases in the range of 
fj, between 4.5 and 4.9, and this point of maximum difficulty for soluble problems 
decreases slightly as more steps are considered. This suggests the quantum method 
has maximum difficulty for soluble problems close to the transition point, as is the 
case for incomplete classical methods. Significantly, this observation indicates the 
quantum method is exploiting the underlying problem structure, as with classical 
heuristics, in contrast to the unstructured quantum search. 

Fig. |9| is an alternate view of the decrease in decay rates as a function of number 
of steps. This figure raises the significant question of whether the decay rates ap- 
proach zero as j '• —>■ oo for soluble problems, and if so, how rapidly. On the log-log 
plot, straight lines correspond to powerlaw behavior, so this figure suggests the decay 
rates decrease as a power of the number of steps. Although this range of j is too 
small for definite conclusions, using the number of conflicts in assignments may give 
high performance, on average, when the number of steps grows only as a power of n. 
This would contrast with the exponential growth in j required by the unstructured 
algorithm. At any rate, the reduction in decay rate with j shows again that using 
conflict information allows using superpositions more effectively than the unstruc- 
tured method where, as described in § |3.4j , the decay rate is not improved by any fixed 
value of j. 

7.3 Using Structure in Partial Assignments 

The algorithm presented above adjusted phases based on the number of conflicts 
in each assignment. Classical heuristics often use additional properties to evaluate 
search states. For the quantum algorithm, these properties are readily included by 
additional phase adjustments. 
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As an example, this section considers the enlarged search space of partial assign- 
ments, i.e., states in which only some of the variables have assignments, as used in 
classical backtrack searches. In many search problems, including SAT, conflicts can 
often be recognized before all variables are assigned, immediately pruning all search 
states involving extensions to the partial assignment. This additional pruning often 
more than compensates for the larger overall number of search states. However, its 
effectiveness depends crucially on the order in which variables are assigned and, for 
each variable, the order in which each possible value is tried. 

One quantum approach for using the information in partial assignments considers 
all possible variable orderings simultaneously f30fl . This in turn requires superposi- 



tions of all 2 2n sets of variable- value pairs, including sets with multiple values for some 
variables, the so-called necessary nogoods This larger search space can readily 
represent more general constraint satisfaction problems, such as variables with dif- 
ferent sized domains. Although proposed as a multi-step algorithm, in analogy with 
classical backtrack searches that attempt to build a solution by extending partial 
assignments, for simplicity we consider here its behavior with a single step, starting 
from an initial superposition with equal amplitude for each set. 

With this representation of the problem, the goal is finding a set in which each 
variable appears exactly once and which has no conflicts with the clauses of the 



SAT problem. The simplest approach modifies the phase matrix P of § |3.1| so P. 



Pc(s)£ m<Tq ^ where q(s) is the number of variables in each set with a unique assigned 
value, ranging from to n, and a is an additional parameter for the algorithm. 
Furthermore, to focus on the information available with partial assignments, c(s) is 
defined as the number of conflicts among only the uniquely-assigned variables. A 
solution is a set with q(s) = n and c(s) = 0. 

The asymptotic analysis proceeds as in Appendix with two modifications. First 
an additional factor of 2~ n appears in P so i n due to the increased search space size. 
Second, the algorithm distinguishes among sets depending not only on the assigned 
values but also on the number of uniquely-assigned variables, giving nine groups of 
variables instead of the four used in Fig. [TJ] of Appendix |C[ With these changes, 
the asymptotic analysis proceeds to give (P so i n ) oc e~ nA where now the decay rate A 
depends also on the additional phase parameter a. 

For this algorithm, Fig. [10] shows the minimum decay rate for (-P S oin)> and com- 
pares it to random selection among complete assignments and the one-step quantum 
algorithm on complete assignments of Fig. |l|. The resulting behavior is worse than 
the complete-assignment algorithm, but by significantly less than the addition of 
log(2) = 0.69 that one might expect just based on the increase in search space size by 
a factor of 2 n . Thus we conclude the information available from partial assignments 
helps concentrate amplitudes toward solutions, but not sufficiently to overcome the 
handicap of the much larger search space, at least in a single step. 

Importantly, the analysis technique introduced here gives a more definitive asymp- 
totic characterization of the algorithm than is possible from empirical simulations. In 
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Figure 10: Minimum decay rate for a single-step algorithm with partial assignments as a 
function of [i (dashed). The solid curve, showing the behavior of the algorithm on complete 
assignments, and the gray curve, showing random selection, are the same as shown in Fig. [jj 
The points indicate empirical estimates of the decay rate for P S oiubie) a lower bound on the 
decay rate for P so i n - The upper edge of the filled region is, in turn, a lower limit on P so i u bie 
given by the Markov bound. 

turn, this characterization identifies good parameter choices for the phase adjustments 
that would be difficult to estimate from simulations. Moreover, it allows comparing 
the benefits of different approaches to using the information available in partial assign- 
ment. For example, basing the phase adjustments on the total number of conflicts in 
a set of variable- value pairs, including those involving duplicate variables, gives worse 
performance than just counting conflicts among uniquely-assigned variables, an ob- 
servation not obvious a priori. Similarly, performance is not improved by allowing 
the phase adjustments to depend separately on the numbers of doubly-assigned and 
unassigned variables, rather than just their sum. On the other hand, generalizing 
the matrix T used to form the mixing matrix U in § |3.1| , so its elements T rr have a 
phase adjustment based on the number of duplicate variables in the set r in addition 
to the size of the set, gives a slight improvement in performance suggesting a further 
study of using the structure in this larger search space may be useful, particularly for 
multiple steps. 

8 Discussion 

We have shown how an analysis based on ensemble averages helps design quantum 
search algorithms. The result was evaluated for satisfiability problems in the hard 
region as well as the easier weakly and highly constrained cases. Compared to unstruc- 
tured search, this gives exponentially better average behavior. Moreover, this perfor- 
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mance uses only a single evaluation of the assignment properties rather than the expo- 
nentially large number of repeated evaluations required by the unstructured method. 
Thus this single-step algorithm requires much less coherence time for the quantum 
operations. The algorithm can be combined with amplitude amplification ||, giving 
an additional quadratic performance improvement, but then requiring coherence time 
extending for the full algorithm rather than just for each trial separately. 

Classical heuristics use more problem properties than the algorithm described 
here. These properties include the difference between the number of conflicts in an 
assignment and those of its neighbors, and the conflicts associated with partial as- 
signments. We illustrated how additional phase variation allows incorporating such 
information in the quantum algorithm, and how the analyses techniques developed 
here can be extended to identify suitable parameters. Thus these techniques can help 
evaluate a variety of quantum algorithms that are not easily addressed theoretically 
and hence would otherwise require slow classical simulation. This evaluation requires 
only that the properties of assignments used by the algorithm and the nature of the 
ensemble allow for an explicit determination of the ensemble averages, in analogy 
with Eq. (^). Furthermore, in many respects this analysis is simpler than that for 
heuristic classical methods. This is because classical searches introduce dependencies 
in their path through a search space based on a series of heuristic choices. These 
dependencies are difficult to model theoretically. By contrast, the quantum search, 
by in effect exploring all search paths simultaneously, avoids this difficulty thereby 
giving relatively simple analytic expressions for the average behavior. On the other 
hand, this analysis is restricted to simple quantities, such as the average probabil- 
ity of finding a solution. How well this reflects typical search costs remains to be 
seen, though the discussion of §0 suggests it gives a reasonable estimate, as well as 
determining good parameter values. 

Classical heuristics often rely on behavior of states near solutions as guides, and 
can become stuck in local minima or among large collections of assignments with 



the same number of conflicts [^T[ . For the quantum algorithm, local minima are not 
an issue: instead the limited correlation between distance and conflicts for states far 
from solutions prevents efficient search. Because of these very different characteristics, 
an interesting direction for future work is identifying individual problems or problem 
ensembles where the correlations are stronger even though the local minima for states 
relatively near solutions remain. In such cases, quantum algorithms could perform 
much better than classical heuristics. 

An important advantage of basing the algorithm on ensembles is the use of av- 
erages rather than requiring detailed knowledge of an individual search problem. 
This contrasts with the unstructured search method which requires knowledge of the 
number of solutions for a particular problem, or various values must be tried repeat- 
edly ||. An interesting open question is whether the algorithm, e.g., the choice of 
p and r, could be improved by adjusting the parameters prior to search based on 
readily computed characteristics of an individual problem instance. In effect this 
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would amount to using a more specific ensemble whose instances are more likely to 
be similar to the given instance than random problems. More generally, the variation 
in performance suggests a portfolio approach |36. 24 1 would be effective for combin- 
ing quantum algorithms using different parameter choices along with various classical 
methods. 

Another possibility is combining this quantum algorithm more directly with clas- 
sical heuristics consisting of independent trials, just as is possible for amplitude am- 
plification. In this case, the heuristic is described not just by the probability to find 
a solution but by the probabilities it finds assignments with various numbers of con- 
flicts, enhancing assignments with relatively few conflicts. Then instead of starting 
with a uniform superposition of assignments, the initial state for the corresponding 
quantum algorithm would have amplitudes proportional to the square root of these 
probabilities. If the probabilities have a simple analytic form, the asymptotic anal- 
ysis could be repreated, allowing optimal selection of the phase parameters for use 
with the classical heuristic. Otherwise samples of the classical heuristic's behaviors 
could be used to estimate the relevant probabilities. While the resulting analysis 
will be more complicated than for amplitude amplificiation with nonuniform initial 
state 0, H |23] , using additional information in the quantum operations (namely the 
number of conflicts in assignments rather than just whether they are solutions) may 
allow for similar improvements as seen here for uniform initial conditions. 

These results show the usefulness of ensemble-based analyses for designing quan- 
tum algorithms. This is particularly helpful because empirical evaluation, through 
classical simulation, is limited to small cases. Because quantum algorithms use prop- 
erties of the entire search space, not just a small, carefully selected sample as with 
classical heuristics, ensemble averages are likely to be more useful for quantum algo- 
rithm development than is the case classically. Thus quantum computing is likely to 
benefit from continued study of the properties of search problem ensembles, particu- 
larly for developing heuristic methods that work well for typical problems. 
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A Random &-SAT 



Random fc-SAT problems are defined by the number of variables n and the number 
of distinct clauses m. Random instances are readily generated ||41|| . This ensemble 



differs somewhat from other studies where the clauses are not required to be dis- 
tinct. Asymptotically, when m <C n fc//2 , as is the case for hard random fc-SAT where 
m = 9 (n), this difference is not important: in such cases, even if duplicate clauses 
are allowed, instances are very unlikely to have any duplicates. However, for highly 
constrained problems or small problem sizes, these ensembles have different behav- 
iors, though qualitatively still fairly similar. In particular, for small sizes, including 
duplicate clauses considers essentially the same problem in samples with different 
values of m, somewhat increasing the sample variation. 

The ensemble of random fc-SAT with n variables has M = {^j2 k possible clauses 
to select from and 

^problems = 0-®) 

possible problems with m clauses, each of which is equally likely to be selected. 

For random fc-SAT, the number of conflicts in assignments is increasingly con- 
centrated around the average as n increases. To see this, let c(s) be the number of 
conflicts in assignment s for a particular problem. The average number of conflicts 
in assignments is 

g = 2-x; c W = 2 " n EEx(«,*) (ii) 

s s a 

where s ) is 1 if assignment s conflicts with clause a and the inner sum is over all 
m clauses appearing in the problem. Interchanging the order of summation gives an 
inner sum J2 S x{ a i s )i the number of assignments conflicting with a given clause 
a, namely 2 n ~ k . Thus c = S Q 2 _fc = m2~ k for every fc-SAT instance with m clauses. 

The variance var(c) = c 2 — c 2 characterizes the spread around this average. We 
have 

c"2 = 2- J2 c(s) 2 = r n EE s) X (a', s) (12) 

s a ,a' s 

The inner sum counts the number of assignments that conflict with both clauses a 
and a', which in turn depends on the number of variables S these two clauses have in 
common. If any common variable is negated in one of the clauses but not the other, 
then no assignment can conflict with both so such clause pairs make no contribution to 
the sum. Otherwise, the two clauses require a specific value for each of 2k — 5 variables 
in assignments conflicting with both, giving 2 n ~ 2k+s such assignments. Thus 

c 2 = 2- 2fe ^2 5 iV clausepairs (5) (13) 

<5 

where iV clause pairs (5) is the number of contributing clause pairs with 5 variables in 
common for the given problem instance, a can be any of the M possible clauses 



26 



but a' must then be selected from among only (jj (j^zfj^ 6 to have 5 variables in 
common with k and contribute to the sum. 

The value of c 2 differs among problem instances, so we consider its average value 
for random fc-SAT. When 5 = k, so the two clauses are identical, there are \ M ~}) 

' ' \m— 1 J 

m n) problems containing 

the two clauses. Collecting these contributions then gives 

l + (m-l) M _ 1 j (14) 

For large n, M > 1 so the variance becomes 

(var(c)) ~ m2- k - m2~ 2fc = c avg (l - 2~ fc ) (15) 

B Mixing Matrix 

The form for the mixing matrix given in Eq. @ follows from Eq. (|j) with the choice 
of Eq. (|7|). To see this, replacing h! = h — z in Eq. (|]) and using the binomial theorem 
gives 

Ud = rv^^ 1)2 g|;f- rf ) e ^, ( 16) 

= 2- n e- i7TTn/2 (l - e inT ) d (l + e i7IT ) n ~ d 

which simplifies to Eq. (|). 

The linearized phases allow a particularly simple implementation of the mix- 
ing matrix. Specifically, Eq. (0) can be written as an overall phase e~ t7TTn ^ 2 times 
n"=i e tnrSj where Sj is the value, or 1, of the j-th bit of assignment s (so J2j s j — \ s \)- 

Thus these phases can be introduced by operating with ( n i7rr \ independently on 



e l 



each bit. 



C Asymptotic Behavior of the Algorithm 

After completing the algorithm, the amplitude in assignment r is 

0r = U rsPs^ = 2^/2 ^d(r,s)Pc(s) ( 1 7) 

Let x( s 5 c ) be 1 if the assignment s has c conflicts, and otherwise x( s i c ) — 0. The 
probability to find a solution is 

Psoln= E \<Pr\ 2 = J2\^\ 2 X(r,0) (18) 

{r\r is a solution} r 
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Figure 11: Clause selection for counting problems contributing to iVp ro biems({f, s, s'}, b, b'). 
The regions correspond to groups of the M possible clauses based on their conflicts with 
r, s and s'. The dark gray region represents clauses that conflict with r and so cannot 
be selected. The white regions represent clauses that conflict only with one of s or s' . 
Contributing problems consist of m clauses such that b and b' conflict only with s and s', 
respectively, and the remaining m — b — b 1 clauses conflict with both s and s' or with neither 
(light gray region). 



This appendix derives the asymptotic scaling behavior of this quantity, averaged 
over the ensemble of random /c-SAT problems. To do so, we first derive an exact ex- 
pression for (P S oin) in terms of the numbers of problems constrained to have specific 
numbers of conflicts with given assignments. This result consists of a sum of quan- 
tities involving binomial coefficients. For large problems, the expression simplifies 
using Stirling's formula. Expressing the resulting sum as an integral then gives the 
asymptotic scaling behavior. 



C.l Average Behavior 

Using Eq. (|ITD and flTH), the average probability of finding a solution is 



1 



(^soln) = y L Y,J2 U d(r,s)U* d (r,s')Y,PcP*c' (x0> C )x0', C ')x( r ) °)) ( 19 ) 



r ss' 



The expected value (x(s, c)x(s', d)x(r, 0)) is just the fraction of problems for which r 
is a solution and s and s' have, respectively, c and d conflicts. Let a be the number of 
conflicts s and s' have in common, and let b = c — a and b' = d — a be their respective 
numbers of distinct conflicts. With Eq. (||), the inner sum over c and d in Eq. ( |19D 
becomes 

£ jMb-b') £ (x(S) b + a)x{s ^ y + a)x(r . 0)) ( 20) 

W a 

The sum over a just gives the fraction of problems A^ prob i ems ({r, s, s'}, b, /-^problems 
for which r is a solution and s and s' have, respectively, b and b' distinct conflicts, 
-^problems s, s'}, b, U) is the number of ways m clauses can be selected from the M 
available to satisfy the conditions on r, s and s'. 
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Figure 12: Grouping of variables based on assigned values in r, s and s' , each shown as 
a horizontal box schematically indicating values assigned to each of the n variables. In 
each assignment, the value given in r to a variable is shown as white, while black indicates 
the opposite value. In this diagram, variables are grouped according to the differences in 
values they are given in the three assignments. For instance, the first group, consisting of 
w variables, has those variables assigned the same value in all three assignments. 



The possible clause selection is illustrated in Fig. [11]. For given assignments r, s 
and s', group those clauses that do not conflict with r as follows. Let N s and N s > be 
the number of clauses that conflict only with s and s', respectively, and A^ ot her the 
number that do not conflict with r and conflict with both or neither of s and s' (the 
light gray region in Fig. |T1|). Then we have 

^.«.(^^'},M')^p-)pr)( m !t%) < 21 » 

as the number of problems for which s and s' have, respectively, b and b' unique 
conflicts, and r is a solution. 

C.l.l Clause Group Sizes 

Now consider the n variables in four mutually exclusive groups based on the values 



they are assigned in r, s and s', as illustrated in Fig. [12]: 

1. the w variables with the same values in all three assignments 

2. the x variables with the same value in r and s, but opposite value in s' 

3. the y variables with the same value in s and s', opposite that of r 

4. the z variables with the same value in r and s', but opposite value in s 

As an example with n = 5, suppose r = 00000, s = 10011 and s' = 00111. The first 
variable has the same assignment in r and s', but the opposite value in s, and is the 
only such variable, so z = 1. The second variable is the only one with the same value 
in all three assignments, so w = 1. Similarly, x — 1 and y = 2. 
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Assignment r conflicts with (j^J clauses leaving M — 0£\ clauses available for 



selection. The principle of inclusion and exclusion [[42] gives the number of available 
clauses that conflict with 

• both s and s' is the number that conflict with both s and s' minus the number 
of those that also conflict with r: 



N hoth = r; y - r ^ 




s only is 

k \ k 



Ns= n _ w + x \_ NhQth 



s' only is 



f? )-rtl-N tMh (22c) 



both s and s' or with neither is 



k 



Mother = I™ (2 fe - 1) - iV s - iV s , (22d) 



Through the expressions of Eq. (f?2]), iVprobiems({ r , s, s'}, 6, 6') given in Eq. ( pi] ) 
depends on w, x, y and but otherwise is independent of the choice of assignments 
r, s and s'. We denote this value as iVprobiemst^, y, z; b, V) since w = n — x — y — z is 
determined by the remaining group sizes. Furthermore, d(r, s) = y + z and d(r, s') = 
x + y. Thus, in Eq. fljjD the sum over the assignments s and s' becomes a sum over x, 
y and z times the number of ways to pick s and s' with assigned values matching each 
other and those of r as specified by the values of w, x, y and z. This latter quantity 
is just the multinomial coefficient ( n ). Finally, because the quantities in the sum 

J \w,x,y,zj ^ 

depend on u>, x, y and z but not the specific choice of the assignment r, the sums 
can be rearranged to move all terms outside of the sum over r. This leaves the inner 
sum as J2r 1 which just counts the number of assignments, i.e., 2™, and cancels the 
factor 2~ n appearing in Eq. (|i~9|). Thus for the ensemble of random fc-SAT, Eq. ( |19|) 
becomes 

= £ L )^< + ,S^ (i - y) ^#^^ (23) 

xyz \ u ') i/5 */ fob' iv problcms 
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C.1.2 An Example 

To illustrate this counting argument, consider n = 3, k = 2 and m — 3. This example 
has M — 12 possible clauses and hence iVp ro bi om s = f^ 2 ) = 220. For assignments 
r = 000, s = Oil and s' = 110, how many of these problems have no conflicts 
with r, b = 1 conflict only with s and b' = 2 conflicts only with s'7 For these 
assignments, w — 0, i.e., there are no variables with the same assigned value in all 
three assignments, and x — y — z — 1. From Eq. (^2|), we then have iVboth = (no 
clauses conflict with both s and s' since they share no pair of variables with the same 
values), N s = N s > = = 3 and N other = 3. 

Thus Eq. ( |2~ll) gives A^ pro bi e ms({ r , s, s'}, 1, 2) = 9. An example is the problem with 
the following three clauses: V x OR (NOT V 2 ), (NOT V{) OR (NOT V 2 ), and (NOT 
Vi) OR V3. None of these clauses conflict with r. The first clause conflicts only with 
s, so b = 1, and the last two conflict only with s', so b' = 2. 



C.2 Asymptotic Behavior 

For random fc-SAT, Eq. (El) gives the exact value for (P so i n ). As n — >• 00, the 



discussion in §3.2 indicates the main contributions are from assignments with close 



to the average number of conflicts, c avg = m/2 . That is, terms for which b and b' are 
O (m). We thus use the scaled values b = b/m and b' = b' /m to simplify the analysis. 
Similarly the main contribution in the outer sum comes from values of d(r, s) = y+ z 
and d(r, s') = x + y close to n/2. This suggests defining w = w/n,...,z = z/n. As we 
will see below, these are indeed the appropriate scaling behaviors for the dominant 
contributions to the sum. 



C.2.1 Sum over Conflicts 

With w,x,y,z scaling as O (n), the number of possible clauses M and each value in 
Eq. © scale as Q = 9 ( n ) . This value is much larger than the actual number of 
clauses m that appear in hard problems, for which m = O (n). We thus consider 1 
m <ti n k . A convenient scaling for the numbers of available clauses is N„, = N_/M 
so that 



-^both — 


(w 


+ y) k — w k 






2 k 




N s = 


1 - 


- (w + x) k 


-^both 




2 k 


N s , = 


1 - 


- (w + z) k 


IVboth 




2 k 


-Mother = 


1 - 


2- k -N s - 





(24) 



with corrections of order 1/n. 
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When S <C \/R, (^) ~ R s /S\. Using Stirling's formula [I] with this expression 



then gives e s+slo ^ R ^ /y/^S. Thus for m < n fc / 2 , iV probloms (:r, y, 2; 6, 6') /iV problems 
e mX rrr l Y where 

X = Slog^ + 6'log^ + (1 - S - 6') log iVother . (25) 
6 6' 1-6-6' 



and 



Y = W = = (26) 

2tt v / 66'(1 -6-6') 



For the inner sum of Eq. (p3|), this quantity is multiplied by exp(«7rmp(6 — 6')) and 
summed over 6 and 6'. When m is large, this sum can be approximated by an integral 
over the scaled variables 6 and 6'. Converting to an integral introduces a power of m 
for each variable, so the inner sum is asymptotic to 

m J dbdb'Yexp(m(X + nrp(b-b'))) (27) 

The asymptotic behavior of this integral as m — > 00 is readily evaluated by the 
method of steepest descents 0. This involves considering complex values for the 
integration variables and noting that the value of the integral is dominated by its 
behavior around a stationary point, i.e., values for 6 and 6' for which X + mp{b — 6') 
has zero derivatives with respect to 6 and 6'. Specifically, the integral is asymptotic 
to the value of the integrand at the stationary point multiplied by 2nmT 1 / V — det D 
where D is the matrix of 2nd derivatives of X + mp{b — V) evaluated at the stationary 
point, and det D is its determinant. Evaluating these derivatives then shows the inner 
sum is asymptotic to exp(m/) with 

/ = log (e i7Tp N s + e-™ p N s> + iV other 



which depends on w, . . . , z through Eq. 

This derivation assumed m y/nJ'. When m is larger than this, the binomials 
give additional contributions. However, if p is small, specifically of order n/m, these 
additions do not change the final asymptotic result. As described below, this behavior 
of p is the appropriate choice for m > n so we use this result over the full set of scaling 
behaviors for m. 

C.2.2 Sum Over Variable Groupings 

For the remaining sum in Eq. (p3|) , over x, y and z, Stirling's formula gives 




exp(nH)n- 3/2 J- — J (29) 



32 



with the entropy 

H = —w log w — ... — z log z 



(30) 



and w = l — x — y — z. 

The u y+z u* x+ factors are exp(nU) with 



U = 2(3 C + P(x + 2y + z) + m(x - z)/2 



(31) 



where, from Eq. (§) 



/5c 




log(cos(7rr/2)) 
log(tan(7rr/2)) 



(32) 



Combining these values with the result from the inner sum again gives a sum that 
can be approximated by an integral. After changing to scaled variables this becomes 



with w = 1 — x — y — z. The method of steepest descents applies to this integral. Thus, 
its asymptotic behavior is determined by the stationary point, namely the values of 
x, y and z for which the derivatives of n(H + U) + ml with respect to these three 
variables are zero. Let A be the corresponding 3x3 matrix of 2nd derivatives and 
A the value of — (H + U + Im/n), both evaluated at this point. The asymptotic 
behavior is then 



evaluated at the stationary point. These quantitites depend on the parameters k, 
p = m/n, p and r. 

The stationary point has no simple closed form but is readily evaluated numer- 
ically. For example, with /i = 4 and parameters r = 0.286, p = 0.218 used in 
Table [I], the stationary point is at w = 0.710, x = 0.101 + 0.158i, y = 0.088 and 
z = 0.101 - 0.158z with A = 0.280 and det A = -478.5, so (P soln ) ~ 0.98e-°- 28n , 
corresponding to the p = 4 curve in Fig. ^. 

C.3 Weakly Constrained Problems 

When 1 < m >C n, the decay rate A = —(H + U + Im/n), can be treated through an 
expansion in the small quantity p = m/n. Specifically, the location of the stationary 
point for the variables x, y and z is determined, to (1), by setting the derivatives 
of H + U to zero. The contribution from the sum over conflicts, pi, only introduces 
corrections of O (p). 

The expression H + U evaluated at the G (1) values for the stationary point is 
zero, for any choice of the parameters p and r. The O (p) values and the contribution 



(-Psoin) ~ n 3/2 [ dxdydz J \ — 
J y {zirywxyz 



exp(n(H + U) + ml) 



(33) 




(34) 
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from pi then give a scaling for (P so i n ) of exp(G(m)). However, for an appropriate 
choice of r and p, the coefficient of this G (m) term can be set to zero, so that the 
actual scaling is dominated by the G (/i 2 ) correction, i.e., exp(Q (np 2 )) corresponding 
to the behavior seen in Fig. |5|. 

The parameter values eliminating the G (m) decay do not have a simple closed 
form. They are determined by trigonometric equations arising from setting to zero 
the derivatives of the G (m) contribution with respect to r and p. The optimal choice 
for r satisfies 

2cosfc (t) 003 ^^) = 1 (35) 

With this value for r, p must then satisfy sin(7r(p + kr)) = 0. Among the many 
possible solutions for p and r, we select the one in the range to 1 for definiteness. 
For k = 3 these equations give r = 0.201389 and p = 0.395832, which correspond to 
the limiting values in Fig. |2] as m/n — > 0. 

These values of the parameters and the corresponding stationary point in Eq. ( p4[) 
give (P so in) ~ e _Qwcakm /n , where a wea k is determined numerically, with the correspond- 
ing decay rate A = a wea k/x 2 . 



C.4 Highly Constrained Problems 

For m > n, we focus on the ensemble of problems with a prespecified solution. In fact, 
with this many clauses, there are relatively few solutions in addition to the prespecified 
one. Thus a simpler evaluation considers the probability that the quantum search 
finds the prespecified solution, rather than any solution. This quantity is a lower 
bound on (P so in), and is a tight bound when m^> n. 

This lower bound is given by setting assignment r in Eq. ([19]) to be the prespecified 
solution rather than summing over all possible assignments. The derivation leading 
to Eq. (|23|) proceeds as before except for two changes. First, eliminating the sum 
over r gives an additional factor of 2~ n . Second, the number of possible problems 
^problems is replaced by 

( m - Cd\ 

\ m I 



(36) 



reflecting the smaller number of problems with a prespecified solution. 

The asymptotic analysis gives an additional overall factor of 2~ n (l — 2~ k )~ m . 
The resulting decay rate for (P so i n ) then has an upper bound given by the value of 
— (H + U — log 2 + {I — log(l — 2~ k ))m/n) evaluated at the corresponding stationary 
point. 

For m ^> n, we can expand the stationary point evaluation in powers of l//i. 
Following the behavior for the optimal value of p suggested by Fig. |2| and the values 
obtained in connection with Fig. ^ we take p to be proportional to The Q (1) 
values for the stationary point in this case are simply x = y = z = 1/4. Because 
p — ► 0, the leading behavior for / is just log(l — 2~ k ) so the exponential scaling 
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of (m) is exactly zero. The contributions to the scaling of (n) can be made 
equal to zero by selecting r = 1/2 and p = 2 k ~ 2 (2 k — l)/(kp). The simple form for 
the (1) stationary point values also allows evaluating the overall (1) asymptotic 
behavior. Specifically, with these parameters, the scaling of the probability to find 
the prespecified solution is 

4 / ( 2 fc -l) 3 7T 2 n 2 

y/16 +{k- 1) 2 7T 2 ^ V 16k2 171 

The corresponding decay rate is A = {2 k — l) 3 7r 2 /(16/c 2 /i). 
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